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ABSTRACT 

Given the enormous galaxy databases of modern sky surveys, parametrising galaxy 
morphologies is a very challenging task due to the huge number and variety of objects. 
We assess the different problems faced by existing parametrisation schemes (CAS, 
Gini, M20, Sersic profile, shapelets) in an attempt to understand why parametrisation 
is so difficult and in order to suggest improvements for future parametrisation schemes. 

We demonstrate that morphological observables (e.g. steepness of the radial light 
profile, ellipticity, asymmetry) are intertwined and cannot be measured independently 
of each other. We present strong arguments in favour of model-based parametrisation 
schemes, namely reliability assessment, disentanglement of morphological observables, 
and PSF modelling. Furthermore, we demonstrate that estimates of the concentra- 
tion and Sersic index obtained from the Zurich Structure & Morphology catalogue 
are in excellent agreement with theoretical predictions. We also demonstrate that the 
incautious use of the concentration index for classification purposes can cause a severe 
loss of the discriminative information contained in a given data sample. Moreover, 
we show that, for poorly resolved galaxies, concentration index and M20 suffer from 
strong discontinuities, i.e. similar morphologies are not necessarily mapped to neigh- 
bouring points in the parameter space. This limits the reliability of these parameters 
for classification purposes. Two-dimensional Sersic profiles accounting for centroid and 
ellipticity are identified as the currently most reliable parametrisation scheme in the 
regime of intermediate signal-to-noise ratios and resolutions, where asymmetries and 
substructures do not play an important role. We argue that basis functions provide 
good parametrisation schemes in the regimes of high signal-to-noise ratios and res- 
olutions. Concerning Sersic profiles, we show that scale radii cannot be compared 
directly for profiles of different Sersic indices. Furthermore, we show that parameter 
spaces are typically highly nonlinear. This implies that significant caution is required 
when distance-based classificaton methods are used. 

Key words: Galaxies: general - Methods: data analysis, statistical - Techniques: 
image processing. 



1 INTRODUCTION 

In the last ten years the field of galaxy evolution has experi- 
enced a boost. With the advent of large ground-based spec- 
troscopic and imaging surveys such as the SDSS (lAbazajian 



et al. 12009 1 or space-based surveys like COSMOS (Scoville 



et al.| 20071, the database of galaxies has increased enor- 
mously. From both very deep as well as very wide area 
surveys substantial amounts of data are available, enabling 
us to study the dependence of galaxy formation and evo- 
lution on e.g. environment, star formation history or stel- 
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lar/bulge/black hole mass. It is now possible to test mul- 
tivariate dependencies and, in conjunction with numeri- 
cal simulations, to describe possible evolutionary tracks of 
galaxies, and to single out not yet fully understood phenom- 



ena like the colour-bimodality of galaxies (e.g. Strateva et al 
|2001[ ) or the linear relation between black hole and stellar 
bulge mass (e.g.|Haring fc Rix||2004| |Woo et aL]|2006l. 



Studies of galaxy morphologies are very important in 
this context, because different morphologies are caused by 
different physical processes that are likely to also affect other 
properties, e.g. star-forming rate, and may also correlate 
with environment. Despite these efforts, it is still a very chal- 
lenging task to meaningfully describe (parametrise) the mor- 
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phologies of galaxies in very large data samples. Although we 
are well able to parametrise the morphologies of individual 
galaxies of certain types (e.g. Simmat et al.|20"l0 |, finding a 
parametrisation scheme that is able to account for the huge 
variety of galaxy morphologies is a completely different task. 



1.1 Strategy 

In this paper we discuss the concept of parametrisation 
and summarise commonly used parametrisation schemes, 



name ly CAS \ Abraham et al.||1994| |1996[ |Bershady et al 
2000}, M 20 (|Lotz et al.||2004|), Gini (|Lotz et al.||2004 



2008), Ser sic profile (|Sersic||19"68} |Graham fc Driver 



shapelets (Refrcgicr 2003) and sersiclets (Ngan et al 



2005), 



2009). 



We categorise these schemes and identify important dif- 
ferences. However, the main intention of this article is to 
determine if there are any fundamental problems involved 
in the parametrisation of galaxy morphologies, which may 
turn out to be subtle or non-obvious. Our investigations 
are designed to test the current paradigm favouring model- 
independent schemes. It has already been shown that the 
diagnostic power of shapelets is limited for elliptical galax- 
ies ( Melchior et al.| [2010), whereas the method of sersiclets 
has not yet been successfully established. Therefore, we fo- 
cus our attention on the caveats involved in the usage of the 
other parametrisation schemes. In the course of this investi- 
gation, we demonstrate that morphological observables are 
intertwined. This new insight implies that all schemes that 
try to estimate observables separately without addressing 
their inherent degeneracies are problematic in principle. 

In the remaining part of this introduction, we de- 
fine the terms "galaxy morphology" and "parametrisation" 
and discuss what parametrisation is meant to achieve. In 
Sect. 2 we introduce two conceptually different approaches 
to parametrisation, namely model-independent (CAS, M20, 
Gini) and model-based schemes (Sersic profile, shapelets, 
sersiclets). As a first fundamental problem and one of our 
main results, we illustrate in Sect. 3 that morphological 
observables are intertwined and cannot be measured inde- 
pendently. Second, we investigate the impact of the point- 
spread function on the concentration index in Sect. 4. Third, 
we consider general problems affecting the classification of 
galaxy morphologies in Sect. 5. Finally, in Sect. 6 we sum- 
marise our results and give recommendations for improve- 
ments of existing or the design of new parametrisation 
schemes. 



1.2 Galaxy morphology 

The morphology of a galaxy is defined by the characteristics 
of its two-dimensional light distribution, i.e. by the projected 
shape of the galaxy. Some morphological observables are: 

• steepness of radial light profile 

• ellipticity (i.e. orientation & axis ratio) 

• asymmetry (e.g. lopsidedness) 

• substructures (e.g. spiral arm patterns, bars, etc.) 

• size 

• centroid 

The centroid position is an important morphological ob- 
servable as well, since it is often required to derive other 



morphological estimators (cf. Table [T}. For decades galaxy 
morphologies have been studied in the visual regime, where 
all these observables are reasonably well defined. However, 
with increasing observational coverage of the electromag- 
netic spectrum, it became evident that morphology is a 
strongly varying function of wavelength. For instance, in 
the UV we observe mostly star-forming regions but no dust 
emission, such that galaxies can look patchy and highly ir- 
regular. On the other hand, in the far infra-red, there is 
almost no stellar but only dust emission. As we discuss in 
Sect. |2.4| many parametrisation schemes for galaxy mor- 
phologies make rather restrictive assumptions that are too 
specialised on the visual regime and cannot be generalised 
to the whole electromagnetic spectrum. As our discussion is 
set in the context of large surveys where galaxies exhibit a 
huge variety of different morphologies, we have to look for 
parametrisation schemes that are flexible enough to describe 
arbitrary morphologies. 

1.3 Observation, parametrisation, inference 

In this section we want to clarify the role of parametrisa- 
tion, i.e. what purpose it serves and what its benefits are. 
Parametrisation is one step in the sequence of observation, 
parametrisation and inference, which is visualised in Fig. [T] 

The process of observation (J-i) provides a nonlinear 
mapping of the true intrinsic galaxy morphology to an ob- 
served morphology. This mapping T\ comprises the projec- 
tion onto the two-dimensional sky, the binning to pixels, the 
addition of pixel noise, and the convolution with the pixel- 
response function (gain of the detector). It also involves the 
convolution with the point-spread function, taking into ac- 
count seeing effects, optics and instrument sensitivity. 

However, analysing galaxy morphologies directly in 
pixel space is infeasible, since the number of pixels is typ- 
ically very large. Therefore, it is necessary to parametrise 
the observed morphology [Ti in Fig. jl|, a step that has 
the two following aims: First, we want to reduce the de- 
grees of freedom, since there is a lot of redundant or un- 
interesting information in pixel space. Second, we want to 
move from pixel space to some other description that bet- 
ter suits a given physical question. Effectively, this means 
that parametrisation can act as a method to reduce the di- 
mensionality of the problem, to suppress noise and to extract 
information. Note that this definition of parametrisation en- 
compasses more than just data modelling. 

Based on such a parametrisation we can then try to 
infer the true intrinsic morphology. For instance, inference 
can be based on the search of multivariate dependencies of 
morphological descriptors on physical parameters or on clas- 
sification. The inference step corresponds to the mapping 
T:i in Fig. [I] where obviously T:>, = J-^ 1 o i.e. both 

mappings T\ and Ti need to be invertible - at least in a 
practical sense. Often inference does not aim at the true in- 
trinsic morphology, but at some abstract type or class that 
represents a reasonable generalisation. Still, if either T\ or 
J-2 destroys too much information, this type of inference is 
impossible as well. For J-i being (approximately) invertible, 
the observation has to have a high signal-to-noise ratio and 
a high resolution relative to the features of interest (critical 
sampling). If this requirement is not met by the data, the 
observation will not resemble the true morphology and infer- 
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Figure 1. Interplay of observation (Fi), parametrisation (F2) 
and inference (J-3). This paper is concerned with the existence 
of the mapping JF^ 1 , which is necessary for J-3 = J 7 ^ 1 o 1 to 
exist. 



ence will be impossible. Bamford et al. (20091 observe this 



problem in the Galaxy Zoo project and term it "classifica- 
tion bias" . They noticed that type fractions resulting from 
visual classifications of 557,681 SDSS galaxies with redshifts 



z < 0.25 evolve significantly with z. As Bamford et al. ( 2009 1 
do not expect a pronounced morphological evolution in this 
redshift regime, they assign this effect to the degradation of 
image quality with increasing redshift. 

If Ti is invertible - i.e. whether or not T% and thus 
J-3 = J 7 } 1 o T2 1 exists - depends on the parametrisation 
scheme. This is the topic of this paper. Consequently, a re- 
liable parametrisation is as important for inference as suffi- 
cient data quality. 



2 PARAMETRISATION SCHEMES 

In order to assess the advantages and deficites of differ- 
ent parametrisation schemes we now briefly summarise the 
most common approaches. We divide them into model- 
independent and model-based approaches. The most im- 
portant difference is that the model-based approaches try 
to model the two-dimensional light distribution of an im- 
age and are thus mostly descriptive. Model-independent ap- 
proaches more directly try to extract physical information, 
hence mixing decription and inference steps. We conclude 
this section by summarising the assumptions involved in the 
parametrisation schemes. 



2.1 Model-independent schemes 

The foremost reason to use a model-independent - or "non- 
parametric" - approach is that it appears to be very simple 
at first glance. Most of these parametrisation schemes seem 
easy to implement, since they do not require to fit a model. 
Furthermore, parameters in all these schemes have at least 
a rough physical interpretation. 



et al.||1994||1996||Bershady et aLp OOO). The concentration 
index is defined as 



C = 5 lo gl 



I'M) 
'•20 



(1) 



where rso and r2o are the radii of circular (or elliptical) 
apertures containing 80% and 20% of the total image fluxrj 
The asymmetry index is defined as 



A = 



pixels 



\I(x) 



Epixels I\ X ) 



(2) 



where J 180 denotes the image / rotated by 180°. Obviously, 
the asymmetry A is bound in the interval [0, 2]. Finally, the 
dumpiness is defined as 



s = 10- 



(3) 



^-/pixels ^("^) 

where I" has been convolved by a Gaussian of width a. The 
specific choice of a is somewhat arbitrary within a certain 
range, being sensitive to clumps of varying spatial extent. 
As far as we know, there is no systematic investigation of 
the impact of the choice of a on the parametrisation results. 

2.1.2 M20 and Gini 

Two further morphological parameters are M20 and the Gini 
coefficient. We define the second-order moment of pixel n 
with value I n at position ( |Lotz et al.|2004[ ) 

M n = /n {Xn - X c f , (4) 

where x c denotes the reference position. Summation of the 
M n over all pixels yields the total second moment Mtot with 
respect to x c . There is a theoretical preference to choose the 
reference position x c to be the centre of light, because this 
choice minimises Mtot- M20 is defined as 



A/20 = log 



Alt. 



(5) 



where the summation JT\ Mi is over the pixels in descending 
order Ji I2 . . . In an d stops as soon as Y2i If ^ 
0.2 £^=1 In, i.e. as soon as 20% of the total flux is reached. 
M20 is supposed to estimate the spatial distribution of the 
most luminous parts of a galaxy image. 



The Gini coefficient was defined by Lotz et al. 
[2008] ) based on |Glasser| p9p2] | as 

^_Eli(2n-iV-l)|/„| 



(2004 



(at-i)e; 



(6) 



where iV is the number of image pixels and |7i| ^ \I2\ ^ 
... ^ I In I are the absolute values of the pixel fluxes sorted 
in ascending order. In contrast to M20, Gini does not require 
an estimate of the centroid position. The Gini coefficient 
estimates the distribution of the pixel values over the image. 



As shown by Lisker ( 2008 1 , it strongly depends on the signal- 
to-noise distribution within a galaxy's image and is thus a 
highly unstable morphological estimator. 



2.1.1 CAS system 

A widely used set of morphological parameters is provided 
by the CAS system, which is based on the so-called Con- 



centration, Asymmetry and dumpiness indices (Abraham 



1 There are several variations of the concentration index: Some- 
times it is based on the ratio of rgo and r$Q. Some authors (e.g. 
|Bershady et al.|2000| consider t he whole image for estimating C, 
others (e.g. |Scarlata et al.|2007| ) estimate C within a region given 
by one Petrosian radius. 
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2.2 Model-based schemes I: Sersic profile 

2.2.1 Definition 

The radial light profiles of many galaxies are reasonably well 
described by the Sersic profile (see Sersic 1968; Graham & 
Driver|2005 for a compilation of relevant formulae), 



I(R) = 1/3 exp < -b n 



l/nc 



(7) 



where ns is the Sersic index and /3 is the scale radiu:^] The 
constant b n is usually chosen such that the radius /3 encloses 
half of the total light. Ip is the intensity at the half-light 
radius /3. At fixed ns, b n is then given by 



T(2ns) = 2 7 (2n s ,&„) 



(8) 



where F and 7 denote the complete and incomplete gamma 
functions. For ng > 0.5 one can approximate b n ~ 2ns — 
|. The Sersic profile corresponds to a Gaussian profile if 
ns = 0.5, to an exponential disk profile if ns = 1, and to a 
deVaucouleur profile if ns = 4. 

Throughout this paper we use a truncated Sersic profile 
of the form 



I(R) 



I(R) 




7(5/3) 



otherwise 



R ^ 5/3 



(9) 



such that all profiles are for R > 5/3 but still continuous. 
This is necessary, since otherwise the profiles do not vanish 
quickly enough for large Sersic indices. 

2.2.2 Redefining b„ and /3 

It is important to note that b n and /3 in Eq. |7| are com- 
pletely degenerate. We are free to make any choice of b n that 
is different from Eq. Q, thereby redefining the model and 
changing the meaning of /3. There are two reasons why Eq. 
(JsJ> potentially is not a good choice for b n : 

(i) From a theoretical point of view, half-light radii /3 are 
not comparable for different values of ns, i.e. the size of one 
galaxies relative to a second galaxy can be inferred from 
their scale radii if and only if both Sersic models use iden- 
tical ns- However, in practice this is rarely a problem, since 
studies usually compare only sizes of galaxies of similar Hub- 
ble types, e.g. in studies of the size-evolution of disc galaxies. 
Nonetheless, choosing b n according to Eq. Q, we must not 
demand that /3 is smaller than the image size, since /3 cannot 
be interpreted this way. We actually need to require that the 
profile drops within the image boundaries. Figure [2] shows 
that the radii where the profile reduces to =, \, and of 
its value at r = vary over several orders of magnitude for 
different ns- The scale radius /3 is more intuitively defined 
such that 



1(0) 



= 1/X 



(10) 



for some X > independent of ns- This can be achieved by 
setting b n = b = logX for all ns- Panel (b) of Fig. [2] shows 
that in this case the radii for different ns change by less 



2 The scale radius /3 is expressed in units of pixels, i.e. /3 1 is the 
pixel size relative to the object size. 




Figure 2. Radii rx where Sersic profile takes values 
I{r x )/I{0) = l/X for X = 2, 4, 10 and b n given by Eq. ^ (panel 
(a)) and b n = log 4 (panel (b)). 



than two orders of magnitude and hence can be compared 
much better. 

(ii) It is well known that there is a strong correlation of 
ns and /3 (e.g. Trujillo et al.|200T I , which is problematic for 
many fit algorithms. This correlation of ns and /3 is almost 
completely induced by Eq. fl8j, i.e. it is artificial. We can 
remove this correlation by setting b n = logX for all ns, 
thereby simplifying the fit problem. We demonstrate this 



in Fig. [3] showing x 2 manifolds for fitting an artificial light 
profile once using Eq. Q (panel a) and once using b n = 
logX for all ns (panel b). The noise level in this simulation 
is low (the signal-to-noise ratio of the central peak is 100). 
Higher noise levels will not change the curvatures of the \ 2 
"valleys" in Fig. [3] but will only broaden them and reduce 
their depth. 

These issues are not fundamental and there is no theoretical 
preference for choosing between these approaches apart from 



the fact that Eq. ( 10 1 is likely to provide more robust param- 
eter estimates. Furthermore, it is possible to convert to and 
fro the definitions of Eqs. dSp and ( 10 1 via 6//3 1/ns = const. 



2.3 



Model-based schemes II: Expansion into basis 
functions 



An alternative model-based parametrisation approach is the 
expansion into basis functions. The most important advan- 
tage of this concept is that the parametrisation is more flex- 
ible, whereas all previous schemes are highly specialised for 
certain morphologies. A good set of basis functions should 
be able to fit almost anything, provided the signal-to-noise 
ratio of the given data is sufficiently high. Hence, this ap- 
proach should in principle be favoured when the task at hand 
is to parametrise arbitrary morphologies. 

Basis-function expansions are very common in physics 
and also in cosmology (e.g. decomposing the CMB into 
spherical harmonics). Usually, the basis functions are cho- 
sen based on symmetry arguments or best as eigenfunc- 
tions of the differential equations describing the underlying 
physics. However, we do not know the physics governing 
galaxy morphologies yet, hence there is no theoretically mo- 
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Figure 3. x 2 /dof manifolds demonstrating how Eq. ^ induces 
the artificial correlation of ng and /3. 

(a) x 2 /dof manifold for b n defined by Eq. QSp . The white dia- 
mond indicates the optimum. The dashed white line is given by 
bn/fi 1 '™ 3 = const and follows the valley, thereby illustrating that 
the correlation of ng and f) is artificial. 

(b) Same as in (a) but for b n = log 4 for all ng. The valley is 
approximately parallel to the rag-axis, i.e. the correlation is gone. 
Both panels use the same artificial light profile with low noise 
level to evaluate the x 2 /dof manifold. It is much easier to find 
the optimum in panel (b) than in panel (a). The optimal values 
of ng are identical in (a) and (b), whereas the optimal values of 
j3 are different due to the different choice of b n . x 2 /dof is not a 
simple quadratic form, because the Sersic profile is a nonlinear 
model. 



tivated choice for the set of basis functions. Therefore, basis 
functions are chosen such that they possess advantageous 
analytic properties or overcome special problems. 

In the following, we introduce the concept of basis func- 
tions. We briefly comment on the issues of orthonormality 
and completeness and then discuss example sets of basis 
functions. 



2.3.1 General concept 

A set of basis functions is usually defined such that it is 
orthonormal and complete. However, we want to introduce 
this concept in a sligthly more general fashion. Consider a 



set of N scalar-valued functions {B\(x; 8i), . . . , Bn(x; On)}, 
where x denotes the two-dimensional pixel-position vector 
and 9 n is the set of parameters of the n-th basis function 
B n . The basis functions may be nonlinear in both x and 8„. 
We consider the linear superposition, i.e. the model, 



/(£) = ^2c n B n (x;i 



(11) 



with the ./V expansion coefficients c n . These coefficients are 
further model parameters in addition to 8„. The c„ enter 
Eq. (Ill linearly, hence they form a linear space, i.e. a vector 
space. Therefore, the set of TV coefficients is also referred to 
as "coefficient vector" c. Given an observed galaxy image 
I{x), we can fit the model f(x) to this image. The details 
of the fitting process will depend on the choice of the set of 
basis functions. The fitting process itself is also called the 
"decomposition of the image into the basis functions" . 

After fitting the model defined by Eq. ( |11[ ) to the image, 
we obtain estimates for the coefficients c n and the param- 
eters 8 n for all basis functions. Usually, the 8 n are used to 
incorporate several effects. For instance, there is typically 
a size parameter that scales the spatial extent of the basis 
functions such that the coefficients c„ do not depend on the 
size of the object. If this is the case, then the basis func- 
tions are called "scale invariant" . The centroid position can 
also be part of 8 n . The linear coefficients c n are supposed to 
capture the morphological information. 



2.3.2 Orthonormality and completeness 

As aforementioned, sets of basis functions are often or- 
thonormal and complete. The orthogonality would ensure 
that all coefficients were completely independent of each 
other. The completeness would allow us to decompose an 
arbitrary image. In practice, however, the completeness is 
lost due to pixel noise and pixellation, which sets an upper 
limit to the number of basis functions that can be used to 
decompose a given image. This can lead to characteristic 
modelling failures. We discuss this in slightly more detail in 
the next section. The strict orthogonality is also lost, due to 

This means that the re- 



pixellation (Melchior et al. 2007) 



suiting coefficients may exhibit minor correlations, but if the 
galaxy image and all basis functions are critically sampled, 
these correlations will be negligible. 



2.3.3 Shapelets 



Shapelets were introduced by Refregier (20031. They are a 



scaled version of Gauss-Hermite polynomials, i.e. 



B n (x;(3) = (2 n n\^Py 1/2 H, 





■ x 2 ■ 


(f ) exp 


2/3 2 _ 



(12) 



where H n denotes the Hermite polynomial of order n and /3 
is the shapelet scale size. A centroid can be introduced via 
x — > x — xq. In this case, all basis functions take identical 
parameters 8 n = 8 = (xo, /?) in order to allow for orthogonal- 
ity. From this definition, we can build two-dimensional basis 
functions, namely Cartesian shapelets and polar shapelets. 

The Gaussian weight function of shapelets leads to very 
nice analytical properties. For instance, shapelets are nearly 
invariant under Fourier transformation, which makes any 
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convolution or deconvolution a closed and analytic opera- 



tion in shapelet space, as described in Melchior et al. ( 2009 1 



However, the limitation of basis functions due to pixel noise 
has a severe consequence: Shapelets employ a Gaussian 



weight function (cf. Eq. ( 12 1 ) , but real galaxies have typ- 
ically much steeper profiles. This gives rise to characteristic 
modelling failures that typically manifest themselves in ring- 
like artifacts in the shapelet reconstructions of galaxies with 
exponential or steeper light profiles. This severly limits the 
diagnostic power of shapelets (cf. Melchior et al.pOlO I and 
we therefore exclude them from our subsequent simulations. 

Despite these fundamental problems, shapelets demon- 
strate a very important aspect of basis-function expansions: 
For highly resolved galaxies of high signal-to-noise ratios 
Sersic profiles are incapable of providing excellent models 
as they are not flexible enough to account for substructures 
such as spiral arm patterns, i.e. their residuals do not al- 
ways reach noise level. In case of shapelets - as an example 
of basis functions - this is fundamentally different. They are 
highly flexible and reach noise level even for galaxies that 



are very large, highly resolved and bright (e.g. Andrae et al 



2010) 



2.3.4 Sersiclets 

Given the problematic impact of the Gaussian profile on 
shapelets, a set of basis functions based on the Sersic profile 
is an obvious means to overcome the limitations of shapelets. 
The resulting basis functions are called sersiclets. |Ngan et al.| 



1 2009 ) were the first to realise the potential of this approach, 



which is capable of accounting for all morphological observ- 
ables listed in Sect. |1.2] However, for technical reasons their 
implementation of sersiclets was flawed, as we illustrate in 
an upcoming paper (Andrae et al. in prep. I. We therefore 



also exclude sersiclets from our simulations. 



2.3.5 Outlook: Template libraries 

We already argued that no basis set - apart from the pixel 
grid itself - is actually complete due to the limitations in- 
duced by pixel noise. Now, we want to briefly touch - with- 
out going into details - on a set of basis functions that is 
finite and thus incomplete from the beginning. The motiva- 
tion is very simple: For both shapelets and sersiclets the ba- 
sis functions lack a physical interpretation. Why not use ba- 
sis functions that directly correspond to spiral arms, galactic 
bars or rings? We can use a set of such templates - a tem- 
plate library - to form linear models and decompose the im- 
age, resulting in a set of coefficients that form a vector space. 
The individual templates do not even need to be orthogo- 
nal, but just as linearly independent as possible in order to 
avoid heavy degeneracies during the fitting procedure. Un- 
fortunately, the direct physical motivation is also the major 
drawback of this approach, since we are strongly prejudiced 
and lack flexibility in this case. For instance, template li- 
braries are likely to have severe problems in decomposing 
irregular galaxies, i.e. they are inappropriate for parametris- 
ing arbitrary morphologies. Moreover, the set of morpholog- 
ical features is very large, hence such a library has to contain 
numerous templates. 



2.4 Assumptions 

It is crucial to be aware of all assumptions made by a cer- 
tain method when using it, since if a method fails, it usually 
fails because one or more of its assumptions break down. In 
case of model-based approaches, the assumptions are usu- 
ally rather obvious and therefore can be easily challenged. 
In contrast to this, the assumptions of model-independent 
approaches are implicit and often hidden. This may lead to 
the misapprehension that model-independent schemes were 
superior since they required fewer or even no assumptions. 

In Table [l] we summarise our categorisation of 
parametrisation schemes. Based on this table and the defi- 
nitions given in the previous sections, we now work out the 
assumptions of all schemes from a theoretical point of view. 
In practice, it is virtually impossible to satisfy all assump- 
tions. Whether the violation of some assumption leads to 
a breakdown of a certain method depends on the specific 
question under consideration, the desired precision, the de- 
tails of the method's implementation, and the quality of the 
data. In detail, the assumptions are: 

• Concentration index: There are no azimuthal structures 
such as spiral-arm patterns or galactic bars [^] The pixel noise 
is negligible and the object is not grossly asymmetric such 



3 This is a mathematical and deeply implicite assumption that is 
generally not realised when working with actual galaxy data: The 
"radii" used to compute the concentration index are estimated 
from a curve of growth. This curve of growth is actually a two- 
dimensional integral over the galaxy's light profile (though it is 
usually reduced to a summation due to pixellation to allow a com- 
ment on an actually irrelevant practical detail). Nevertheless, it is 
inevitable to parametrise this integration in some way in order to 
be capable of evaluating it (analytically or numerically). In simple 
words, one has to define what "radius" means (e.g. spherical or el- 
liptical radius) and this definition is the assumption. For instance, 
assuming spherical integration contours, the curve-of-growth in- 
tegral of an image f(r,ip) reads p(R) = f^drr dip f(r, (/?), 
where the integral has been parametrised in polar coordinates. 
In fact, Figure [5] can be understood as investigating what hap- 
pens if the curve of growth indeed takes this spherical form but 
the image data is not spherically symmetric but elliptical. More 
physically, though already beyond the point: In case of an image 
that has perfectly circular or elliptical symmetry, the azimuthal 
integration in p(R) is well denned and so are the radii and the 
concentration index. However, if there is more complicated az- 
imuthal structure than ellipticity, there is no simple way to gen- 
erally define the curve of growth. Either, the integration is along 
true isophotes. In this case, the shape of the integration regions 
will vary from object to object and potentially also with radius. 
Then the resulting concentration indices would not be compa- 
rable. The other option is to integrate along given circular or 
elliptical isophotes to enforce comparability. This approach ex- 
plicitcly assumes that there is no azimuthal structure or else the 
radius in p(R) has no strict relation to the galaxy, and the es- 
timated curve of growth will be biased. The justification to use 
this in practice is to assume that in reality objects of similar type 
will catch similar biases, such that concentration indeces still have 
discriminative power in a differential sense, though their absolute 
values may be biased. Furthermore, the mere presence of such a 
bias does not automatically imply that the resulting estimates of 
the curve of growth and the concentration index, respectively, are 
not meaningful anymore. 
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that a centroid is well defined (cf. Sect. 3.3 1. The scheme 



can be enhanced using elliptical apertures. 

• Asymmetry index: A centre of rotation is well defined. 
The pixel noise is negligible. Both issues have been addressed 
by Conselice et al. (20001. The asymmetry of interest is vis- 



ible under rotations of 180°. 

• dumpiness index: The functional type of the kernel 
matches the galaxy profile. The width of the kernel is cho- 
sen such that the information of interest is extracted. The 
cllipticity of the kernel matches the ellipticity of the object. 

• M20: The pixel noise has negligible impact on the esti- 
mates of centroid and second moments. The centre of light 
and the object's centre coincide, i.e. there is no substantial 
asymmetry. The structures dominating M20 are of circular 
shape with the centroid at their centres]^] 

• Gini coefficient: The pixel noise is negligible (see|Lisker| 



2008) 



• Sersic profile: The Sersic profile is a good match of the 
object's light profile. In particular, this means that the ob- 
ject's light profile is symmetric, monotonically decreasing 
and the steepness is correctly described by the model, and 
there are no azimuthal structures such as spiral arm pat- 
terns, galactic bars or rings. 

• (Spherical) shapelets: Employing the Gaussian weight 
function fits galaxy profiles. Using spherical basis functions 
that have no intrinsic ellipticity does not lead to problems. 

We now clearly see that model-independent schemes im- 
plicitely make assumptions, too. This list suggests that non- 
parametric approaches tend to invoke fewer assumptions 
than model-based scheme^] at the loss of reliability, as we 
are going to demonstrate in the following sections. We also 
want to emphasise that shapelets - as an example of basis 
functions - can describe asymmetries. 



3 INTERTWINEMENT OF 

MORPHOLOGICAL OBSERVABLES 

The basic idea of model-independent schemes is to estimate 
the different morphological observables listed in Sect. |1.2| 
independently of each other, thereby simplifying the prob- 
lem. However, in this section we present as one of our main 
results the fact that these morphological observables are in- 
tertwined, which means that it is impossible to measure 
them independently of each other. Even if we try to mea- 
sure only a single observable using a method unaware of 
the other observables, the mere presence of these observable 
features will influence the results. The notion of intertwine- 
ment should not be confused with redundancy, e.g. Sersic in- 
dex and concentration index are perfectly redundant (Sect. 
3.1 1 but asymmetry and concentration index are not (Sect. 
3.3 1. Of course, for some observables the intertwinement is 
stronger than for others. This intertwinement is not of phys- 
ical origin but stems from the fact that usually all morpho- 



:nac: 



4 This assumption stems from the term (x„ — xq) 2 in Eq. 

5 However, it is not true in general that model-independent 
schemes invoke fewer assumptions than model-based approaches. 
As an exception to this "rule" , compare concentration index and 
shapelets. 



logical observables are present simultaneously, such that the 
assumptions listed in Sect. |2.4] are never truly satisfied. 

We carry out noise-free simulations of the different 
parametrisation schemes and by doing so we reveal several 
systematic misestimations - in particular of the concentra- 
tion index. All simulations invoke Sersic profiles and we want 
to explicitly emphasise that it is not necessary for real galax- 
ies to actually follow Sersic profiles Q However, as we demon- 
strate in Sect. |3.1| Sersic profiles provide parametrisations 
that are in excellent agreement with estimates of light con- 
centration. This would not be the case if Sersic profiles were 
a bad description. Pixel noise in real data may hide these 
biases to some extent, but they will still be present. 



3.1 Example I: Sersic profile vs. concentration 
index 

We begin with comparing Sersic profiles and the concen- 
tration index, establishing a relation between both schemes 
that allows us to assess systematic effects on the concentra- 
tion. The Sersic index estimates how steeply the radial light 
profile falls off. Consequently, Sersic index and concentration 
index are essentially two estimators for the same morpholog- 
ical feature, namely the steepness of the light profile. This 
is also evident from the fact that both schemes have almost 



identical assumptions (cf. Sect. 2.4|. In fact, we can com- 
pute the concentration of a two-dimensional Sersic profile 
using numerical integration, i.e., Sersic index and concen- 



tration index are perfectly redundant (see also Trujillo et al. 
20011. Integration the flux to infinite radius, Eq. |1| yields 
the power law 



C « 2.770 • r4' 466 



(13) 



which provides a good approximation for the exact numer- 
ical solution for 0.5 ^ ng ^ 7. The resulting values of 
ns = 0.5, 1 and 4 are identical to those given by |Bershady| 
(20001. Integration the flux to one Petrosian radius 



et al. 



instead of infinity, the approximate solution is 



C w 2.586 • n° s 305 



(14) 



Obviously, any declining radial profile can be mapped onto 
the concentration index this way, irrespective of whether or 
not it is a good description of a galaxy. Therefore, Fig.[4]also 
compares this theoretical expectation with the measured 
concentration indices and Sersic indices of 31,288 COSMOS 
galaxies from the Zurich Structure fc M orphology catalogue 



( Scarlataet al.|2007[[Sargent et al.|2007| ) [^Evidently, the 



dependent estimates of concentration indices conducted by 



Scarlata et al.|([2007) and of Sersic indices conducted by Sar- 



gent et aLj ( |2007[ ) are in excellent agreement with the theo- 

This clearly demonstrates that 



retical prediction of Eq. ( |14[ ) 
concentration and Sersic indices are equivalent parametrisa- 
tions in case of COSMOS galaxies, providing largely unbi- 
ased estimates. Nevertheless, this single example does not 
supersede a detailed study of potential biases that may oc- 
cur in practice. In particular, the COSMOS data shown in 
Fig. [4] exhibits a large scatter that may hide biases. 



To be more precise, it is perfectly valid to use such idealised 
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Table 1. Characteristics of parametrisation schemes. 
W We can employ elliptical isophotes to compute C. 

( 2 ' A is invariant under all operations that are symmetric under rotations by 180°. Ellipticity is such an operation. 

( 3 ) It is possible to use an elliptical Gaussian for convolution. 

( 4 ) There are spherical and elliptical shapelet formalisms. 
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Figure 4. Comparing concentration and Sersic indices of 31,288 
COSMOS galaxies from the Zurich Structure & Morphology cat- 
alogue jSargent et al.|2007| (blue points) with the numerical so- 
lution (red solid curve) and power-law fit of Eq. (oranged 
dashed curve). Shown are COSMOS galaxies with I < 22.5, valid 
axis ratios (0 < q ^ 1), and flags "stcllarity" , "junkflag" and 
"flagpetro" of 0. Concentration indices were predicted from an- 
alytic Sersic profiles using numerical integration out to one Pet- 
rosian radius. There was no pixellation. 



3.2 Example II: Steepness of light profile vs. 
ellipticity 

Our second example is the intertwinement of the steepness of 
the radial light profile and the ellipticity. These two are cer- 
tainly the most important morphological observables listed 
in Sect. |1.2| having the largest impact on parametrisation 
results. 

It is obvious that estimates of the steepness of the radial 
light profile must take into account ellipticity. Therefore, it 
is necessary to use elliptical isophotes in case of the con- 
centration index or to fit a two-dimensional Sersic profile 
that is enhanced by an ellipticity parameter. Unfortunately, 
in case of the SDSS, the aperture radii containing 50% and 



simulations to discover these biases, but in order to correct for 

them more realistic simulations are necessary. 

7 http:/ /irsa. ipac.caltech.edu/data/COSMOS/datasets. html 



90% of the total image flux given in the SDSS database are 
chosen as circular apertures ( jStrauss et al.|[2~002[ ). This im- 
plies that estimates of the concentration index drawn from 
these values may be biased. In fact, this bias was already dis- 
cussed by |Bershady et aL] ( |2000| . They investigated how the 
concentration index changes with axis ratio for samples of 
real galaxies of similar morphological types. Bershady ct al. 
(2000) claim that using circular apertures causes an over- 
estimation of concentration indices of at most 3% and is 
therefore negligible. We investigate this effect in Fig. [5] for 
a realistic range of axis ratios, as is evident from panel (a). 
Panel (b) shows how the concentration index is influenced 
by the axis ratio for Sersic profiles with fixed Sersic indices, 
co rresponding to galaxy samples of similar morphologies as 
in |Bershady et al.| | |2000| P1 Evidently, for q > 0.5 - which is 
the majority of galaxies in the given set - the bias is negli- 
gible. There are galaxies with q < 0.5, which are typically 
disc-like galaxies with shallow light profiles. For those ob- 
jects concentration estimates based on circular isophotes are 
substantially overestimated (« 30% for ns = 1)- This bias 
is not negligible. Bershady et al. ( 2000 I based their inves- 



tigation on estimated concentration indices of real galaxies. 
Hence, the most likely origin of this discrepancy in our re- 
sults is that the intrinsic scatter in the real data used by 



Bershady et al. (20001 hid this bias. Considering ellipticity 
and concentration index together - instead of using an ellip- 
tical concentration index - is not likely to solve this problem. 
The reason is that incorporating an ellipticity estimate may 
add information about the cause of the bias of the concen- 
tration index, but it does not provide information about the 
effect of this bias. Finally, we want to emphasise that Fig. [5] 
must not be used to calibrate the biased concentration esti- 
mates resulting from circular apertures. The reason is that 
this would now require Sersic profiles to be a realistic de- 
scription of galaxy morphologies. Moreover, also the study 



of Bershady et al. ( 2000 ) cannot be used for such a purpose, 



because the bias clearly depends on the intrinsic concentra- 
tion. This means that such a correction would require prior 
knowledge about the object's true concentration. 

Vice versa, Melchior et al. (20101 showed in the con- 



text of weak gravitational lensing that ellipticity measure- 
ments using shapelets are strongly biased in case of steep 



8 Obviously, Sersic profiles are rather idealised and by far not as 
realistic as the sample used by [Bershady ct al. (2000|. However, 
this does not hamper the validity of this test, but rather serves 
the purpose of isolating this bias. Apart from that, there is no 
difference in both studies. 
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Figure 5. Impact of ellipticity on concentration estimates. Panel 

(a) shows the distribution of axis ratios q = b/a for 2,272 SDSS 
galaxies from the data sample of |Fukugita et al.| l |2007| ) . Panel 

(b) shows concentration estimates using circular isophotes for el- 
liptical Sersic profiles with ng = 0.5 (solid orange line), rig = 1 
(dashed red line), ng = 2 (dotted-dashed blue line), and rig = 4 
(dotted black line). 



profiles. In other words, shapelets fail to provide reliable 
ellipticity estimates, because they do not properly account 
for the steepness of the radial light profile. This impressively 
demonstrates that these two observables may be closely in- 
tertwined. 



3.3 Example III: Impact of lopsidedness on 
centroid estimation 

As a third example for the intertwinement of morpholog- 
ical observables, we consider the impact of asymmetry on 
centroid estimates and the resulting parameter estimation 
using two-dimensional Sersic profiles. We simulate a certain 
type of asymmetry, namely lopsidedness. In order to intro- 
duce lopsidedness analytically, we apply the flexion transfor- 



mation from gravitational weak lensing ( Goldberg & Bacon 
2005 ) to the Sersic profiles as explained in Appendix |A| The 
strength of the flexion transformation is parametrised by F\ , 
F2, Gi, and G2. There is no pixel noise in this simulation. 
Figure [6] shows Gaussian profiles resulting from this trans- 
formation]^] The resulting distortions are not unrealistically 
strong. 

In Fig. [7] we investigate the impact of this type of asym- 
metry on the centroid, the asymmetry index and the con- 
centration index. The first and foremost consequence is that 
in the presence of asymmetry the maximum position and 
the centre of light as given by 



xo 



(x) 



(15) 



Sri f n 

where x n and /„ denote the position vector and value of 

9 The flexion transformation of Eq. jA10[ | will produce a second 
solution of x' = 0, which corresponds to multiple images in weak 
lensing. We only consider cutouts with just one image, where the 
other image resulting from the second solution to x' = is far 
away. 



F 1 =0.0000 



lit 



*i =0.0325 



n 



F, =0.0650 



Figure 6. Gaussian profiles of different lopsidedness. The ap- 
plied flexions are Fi = 0.0 (top left), Fi = 0.0325 (top right), 
and Fi = 0.065 (bottom). The resulting profiles exhibit realis- 
tic lopsidedness. All profiles are evaluated on a 1000x1000 pixel 
grid using a scale radius of /3 = 50. White diamonds indicate the 
maximum position. 



pixel n, do not coincide anymore. Hence, we call this spe- 
cial type of asymmetry "lopsidedness" . The centre of light 
Xco\ = (3) and the maximum position £ max coincide if and 
only if the light distribution is symmetric. As is evident from 
Fig. [7] the lopsidedness is stronger for steeper profiles, where 
the maximum lopsidedness is |a; co i — £ max |/,0 ~ 0.25. More- 
over, Fig. [7] demonstrates that, especially for steep profiles, 
estimates of asymmetry and concentration strongly depend 
on the choice of centroid. Asymmetry indices estimated with 
respect to maximum and centre of light may differ substan- 
tially in the presence of lopsidedness considering the allowed 
parameter rangep^Moreover, Fig.[7]reveals that the concen- 
tration estimated with respect to the maximum position is 
almost insensitive to lopsidedness, whereas the concentra- 
tion estimated with respect to the centre of light can be 
biased low by up to 15%. This also explains to some extent 
why the observed and predicted concentration indices differ 
in Fig. [4j because the observed concentration indices were 
estimated with respect to to the centre of light rather than 
the maximum position (cf. Scarlata et al.|[2~007 k 

We have demonstrated that the parametrisation results 
differ significantly depending on whether we use the cen- 
tre of light or the maximum position as centroid. How do 
we resolve this ambiguity? And how do we get the maxi- 
mum position in practice, when we suffer from pixel noise? 
If the parametrisation scheme was model-based, the model 
would define the centroid during the fit procedure - even 
in the presence of pixel noise. For instance, the Sersic pro- 
file should use the maximum position as centroid, whereas 
shapelets can use both maximum position or centre of light. 



10 The steps in panel (c) are due to the computation of A co \, since 
x co \ is changing as Fi increases. Whenever x co \ enters a new pixel, 
the set of pixels used for computing A co \ changes. There are also 
steps in C co i, but they are very small. 
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Figure 8. Impact of lopsidedness on real (a) and imaginary (b) 
part of Xcol- Considering ^ IXcoll < 1> the rea l part is strongly 
biased by the lopsidedness. The imaginary part is unbiased due 
to the geometry of Fi (cf. Fig. [6J . All Sersic profiles are evaluated 
on a 1000x1000 pixel grid using = 50. 



computation time is approximately of the same order as, 
e.g., fitting a shapelet model. We conclude that concentra- 
tion and asymmetry estimates are neither easy to implement 
nor computationally faster than model-based approaches. In 
case of M20, there is a theoretical preference to use the cen- 
tre of light, since it minimises the total second moments. 



Figure 7. Impact of lopsidedness on centroid (a), asymmetry 
with respect to maximum (b), absolute difference of asymmetries 
with respect to centre of light and maximum (c), concentration 
with respect to maximum (d), and relative difference of concen- 
trations with respect to centre of light and maximum (e). Lop- 
sidedness leads to a difference in maximum position and centre 
of light. Furthermore, lopsidedness creates asymmetry. Asymme- 
tries evaluated with respect to the maximum or centre of light 
can differ substantially given that A S [0,2], The concentration 
evaluated at the maximum position is almost insensitive to lopsid- 
edness. However, the concentration with respect to centre of light 
is strongly underestimated. All Sersic profiles are evaluated on a 
1000x1000 pixel grid using f3 = 50. See footnote for explanation 
of the steps in panels (c) and (e). 



However, since C, A and M20 are not model-based, we have 
to resort to convention or ad-hoc solutions. In case of the 



asymmetry index, Conselice et al. (2000) solved this problem 



by searching for the position that minimises the value of the 
asymmetry index, also considering resampling the image on 
a refined pixel grid. They were able to show that there are 
usually no local minima of asymmetry indices and hence that 
their method is stable. In case of the concentration index, 
using the maximum position appears to be more plausible 
than the centre of light, since C ma x appears to be robust 
against lopsidedness. Unfortunately, the concentration does 
not provide us with model and residuals, hence we cannot 
estimate the most likely maximum position in the presence 
of noise. However, we can apply the same ad-hoc solution 



that Conselice et al. (20001 introduced for the asymmetry 



index, by searching the position that maximises the con- 
centration estimate. Nevertheless, this method increases the 
computational effort tremendously such that the required 



3.4 Example IV: Impact of lopsidedness on 
ellipticity estimators 

As our last example, we discuss the impact of asymmetry 
on estimators of ellipticity. Again, we simulate asymmetry 
as lopsidedness as in the previous section. We apply flexion 
transformations to two-dimensional Sersic profiles without 
noise. However, we do not apply shear transformations, i.e. 
all profiles have no intrinsic ellipticity. From the pixellised 
images we then estimate the second moments of the light 
distribution, 

n _ X] n ^n{x n ,i — Xqj)(x„j — Xqj) 

where xq is the point of reference, e.g. centre of light or 
maximum position. Using the second moments, we compute 
the estimator (e.g. Bartelmann fc Schneider|2001 1 

Q22 + 2iQ 12 



X ■ 



Qn + Q22 

This estimator is related to the axis ratio via 



(17) 



^ 1 and to the orientation angle 9 via tan(2#) 



i+lxl 

. If this estimator detects any ellipticity, it will be com- 
pletely artificial, i.e. it will be a bias. 

Figure [8] shows results of this simulation. For perfectly 
symmetric profiles (Fi — 0) the estimator indeed does not 
detect any ellipticity. However, if Fi increases, the ellipticity 
estimator will be biased. The bias is stronger for steeper 
profiles. The maximum bias is 5ft(xcoi) ~ 0.13 (b/a ~ 0.877), 
which is substantial. 

We conclude from this simulation that asymmetries 
have a potentially strong impact on ellipticity estimates, i.e. 
asymmetry and ellipticity are intertwined. For instance, this 
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is relevant in case of using elliptical isophotes for estimating 
the concentration index. 



3.5 Reliability assessment 

In the previous sections we have demonstrated that some 
important morphological observables cannot be measured 
independently of one another. Given this, it cannot be guar- 
anteed that estimates of an individual observable will result 
in a parametrisation which is unbiased by the other observ- 
ables. As all the parametrisation schemes mentioned in Sect. 



2 are derived on rather restrictive assumptions (cf. Sect. 2.4 1, 
their flexibility in describing arbitrary galaxy morphologies 
is therefore limited. Consequently, it cannot be expected 
that these schemes provide accurate descriptions of all indi- 
vidual objects in a given data sample. 

Can we assess the quality or reliability of the parametri- 
sation results for individual objects, i.e., can we detect ob- 
jects where the parametrisation failed in order to sort them 
we are using a model-based parametrisation scheme 
(e.g. shapelets or Sersic profiles), the residuals of the result- 
ing best fit will provide us with an estimate of the goodness 
of fit. For instance, a very large value of \ 2 compared to 
the number of degrees of freedom indicates a poor fit, i.e. 
we should not rely on the parametrisation of this individ- 
ual object. However, if the parametrisation scheme is not 
model-based - as in case of CAS, M20 and Gini - we have 
no residuals and hence we have no way of assessing the re- 
liability for individual objects]^] 



3.6 How to disentangle observables 

As we showed above, morphological observables are inter- 
twined and cannot be measured independently. Is there a 
way to get independent estimates? 

Let us consider two morphological observables A and 
B (e.g. Sersic index and ellipticity) . Intertwinement means 
that the joint probability of A and B does not factorise, i.e. 



prob(A, _B|data) 7^ prob(v4|data) prob(B|data) 



(18) 



Using Bayes' theorem, we can rewrite the joint probability 
of A and B as 



prob(A, £>|data) 



prob(A, B) prob(data| A, B) 
prob(data) 



(19) 



where prob(A, B) denotes the prior probability of A and B, 
prob(data| A, B) is the likelihood function and prob(data) a 
normalisation factor. A model that simultaneously measures 
A and B will provide us with the likelihood function, which 
in case of Gaussian residuals is 



prob(data|yl, B) oc e 



-x 2 /2 



(20) 



11 Note that this task is completely different from testing the 
reliability using simulations. Such simulations allow to assess and 
calibrate a parametrisation scheme in general, but they do not 
help in detecting parametrisation failures for individual objects. 

12 Note that reliability assessment and error estimation are 
two different things. Error estimation is possible for model- 
independent approaches, e.g. via bootstrapping. 



We then get independent estimates of A and B via marginal- 
isation 



/ 



prob(A|data) = / dB prob(A, B|data) , (21) 



prob(Bjdata) = j dAproh(A, B|data) 



(22) 



Obviously, this only works for model-based parametrisation 
schemes, since otherwise we do not have residuals and can- 
not evaluate the likelihood function. In other words, even if 
we found a model-independent parametrisation scheme that 
accounts for all observables simultaneously, we would not 
know how to disentangle the estimates. In addition to relia- 
bility assessment, this is another strong argument in favour 
of model-based approaches. 



The marginalisation integrals of Eqs. (211 and (22 1 are 



usually very hard to evaluate, unless we use Markov-Chain 
Monte-Carlo (MCMC, e.g. |MacKay|2008| ) methods. In case 
of MCMC methods, we get those marginalisations for free, 
without any further effort. 



4 IMPACT OF PSF ON THE 
CONCENTRATION INDEX 

In Sect. 3, we introduced the notion of intertwinement that 
may systematically influence morphological parameters. An- 
other important origin of systematic effects is the point- 
spread function (PSF), as we illustrate in this section. The 
fact that parameters such as the concentration index may be 
influenced by the PSF is not new but has been long known. 



For instance, Scarlata et al. (20071 find that the PSF has 



a significant effect for objects with half-light radii smaller 
than two FWHM of the HST ACS PSF and with high Ser- 
sic index, while the effect is negligible for larger objects. In 
an attempt to overcome this bias, |Ferreras et al.| ( |2009[ ) ap- 
plied a correction to the measured concentration parameter, 
based on the half-light radius. The aim of this section is to 
reassess the impact of the PSF on estimates of the concen- 
tration index. 



4.1 Forward vs. backward PSF modelling 

In case of model-based parametrisation schemes it is stan- 
dard practice to account for the PSF by forward modelling, 
i.e. to fit a convolved model to the convolved data. In case of 
parametrisation schemes that are not model-based this is im- 
possible and we have to resort to backward PSF modelling, 
i.e. we deconvolve the data before the actual parametrisa- 
tion is done. However, deconvolution in the presence of pixel 
noise is numerically unstable, so forward PSF modelling is to 
be favoured if possible. This is another practical disadvan- 
tage of parametrisation schemes that are not model-based, 
because they need to perform either an unstable backward 
modelling or they need to invoke another ad-hoc correc- 
tion calibrated in simulations. Such simulation-based cali- 
brations introduce a further assumption into the parametri- 
sation process. Model-based schemes are much more rigorous 
in this respect, since they allow for a mathematically well- 
defined PSF treatment that does not introduce any further 
assumption. 
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Figure 9. Impact of PSF on miscstimation C—C of concentration 
index for different PSF sizes and Sersic profiles. All Sersic profiles 
are evaluated on a 1000x1000 pixel grid using J3 = 50 and b n = 
2ns ~ 1/3. With increasing PSF size with respect to the object 
size the concentration index estimated from the convolved image 
is more and more underestimated. 



4.2 Impact on concentration 



In case of the ZEST, Sargent et al. (20071 accounted for 



the PSF by forward modelling when estimating the Sersic 



index, while Scarlata et al. ( 2007 1 neglected the PSF when 



estimating the concentration index. The fact that the results 
shown in Fig.[4]are in agreement with theoretical predictions 
suggests that in the case of the COSMOS data the PSF can 
indeed be neglected for the concentration index. Therefore, 
the theoretical prediction supports the claim by |Scarlata| 
et al. (20071. Nevertheless, this single example should not 
mislead us to generalise this conclusion. It is not guaran- 
teed that the PSF will have no impact on the concentration 
index for data sets other than COSMOS that exhibit differ- 
ent signal-to-noise, PSF, and resolution. 

In order to test the impact of the PSF on the concentra- 
tion index, we generate two-dimensional Sersic profiles with 
ns=0.5,l,2,4 and convolve these profiles with a Gaussian 
kernel of increasing FWHM0 We expect that the concen- 
tration indices of very steep Sersic profiles are severly un- 
derestimated, since the PSF washes out the sharp peak. For 
lower Sersic indices this effect becomes smaller. For n$ = 0.5 
the concentration should not be affected at all, since con- 
volution of a Gaussian with a Gaussian yields a Gaussian, 
i.e. the steepness of the profile does not change. Figure [9] 
confirms our expectation. If we ignore the PSF, we can sig- 
nificantly underestimate the concentration index. 

We conclude from this test that although the PSF is 
indeed negligible in case of the ZEST, this cannot be gener- 
alised to other data sets. Consequently, a PSF treatment is 
always necessary at least when using the concentration in- 
dex. In particular concerning ground-based telescopes, the 
PSF is usually not small compared to the peak exhibited by 
highly concentrated objects. 



13 We are aware that the COSMOS PSF is not a Gaussian. This 
test is meant to demonstrate the principle of this effect. 



We now discuss the parametrisation of galaxy morphologies 
in the context of classification. First, we show that if we 
do not account for all morphological observables simulta- 
neously, the effects discussed in the previous sections can 
dilute discriminative information. Second, we show that all 
parametrisation schemes discussed here form nonlinear or 
even discontinuous parameter spaces. Third, we comment 
on the problem of high-dimensional parameter spaces. 



5.1 Loss of discriminative information 

The conclusion from our investigation of the intertwinement 
was: If a parametrisation scheme does not account for all 
morphological observables simultaneously, the results will be 
systematically altered, i.e. biased. How does this influence 
classification results? For a large sample of objects, the ori- 
gins of these systematic effects have random strength. Con- 
sequently, we have to expect an increase in the scatter of 
the resulting parameters. The sample distributions of the 
parameters will be broadened due to the additional scatter, 
i.e. peaks in the distributions are reduced and troughs be- 
tween different peaks are washed out. In other words, we are 
loosing discriminative information. 

We now demonstrate this broadening of parameter dis- 
tributions: We generate samples of two-dimensional Sersic 
profiles with fixed Sersic indices of ns = 1,2,3,4. We then 
add a random ellipticity and a random lopsidedness via the 
flexion transformation of Eq. (A10I. The flexion parame- 



ter Fi is drawn from a uniform distribution on the interval 
[—0.065,0.065]. The ellipticity is drawn from the joint dis- 
tribution of Sersic indices and axis ratios of 2,000 COSMOS 
galaxies randomly drawn from the Zurich Structure & Mor- 
phology catalogue. We then sample the Sersic profiles on 
a 1,000x1,000 pixel grid using a scale radius of /3 = 50. 
We convolve the resulting image with a Gaussian PSF of 
FWHM= 37.5 chosen such that the effects of Fig. [9] are 
present but moderate. There is no pixel noise in this simula- 
tion. From the pixellised image we then estimate the concen- 
tration with respect to the maximum position and the centre 
of light, since Sersic index and concentration are two differ- 
ent estimators for the same morphological feature. Concen- 
tration estimates also take into account elliptical isophotes, 



where the ellipticity is estimated via Eq. ( 16 1 with respect to 



the maximum position and the centre of light, respectively. 

Figure [To] shows the results of this simulation. The dis- 
tributions of concentration indices have a finite width, in 
contrast to the distribution of the Sersic indices, which are 
infinitely thin 5-peaks. Consequently, we are indeed loosing 
discriminative information. In reality this loss may be even 
more severe, since the distribution of Sersic indices has it- 
self a finite width. Moreover, Fig. [To] reveals that the loss of 
discriminative information is stronger for the concentration 
index evaluated at the centre of light. Especially for large 
Sersic indices the peaks are lowered and broadened. This is 
a strong argument to evaluate the concentration at the max- 
imum position (if it were accessible), since we conserve more 
discriminative information. In the presence of an unconsid- 
ered PSF, the parameter space is substantially biased. This 
has the advantage of reducing the width of the distributions, 
but it also shifts the different modes closer together. If the 
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Figure 10. Normalised sample distributions of concentration in- 
dices estimated with respect to (a) the maximum position of un- 
convolved image, (b) the centre of light of unconvolved image, 
and (c) the centre of light of convolved images. The modes in 
the distributions correspond to samples of 10,000 profiles each 
with fixed Sersic indices of exactly ng = 1,2,3,4 (from left to 
right). The finite widths of all modes in all distributions indicate 
the loss of discriminative information. This is particularly evi- 
dent in panel (b), where the modes of very compact objects are 
substantially broadened. All Sersic profiles were evaluated on a 
1000x1000 pixel grid using a scale radius of j3 = 50. The Gaus- 
sian convolution kernel for panel (c) was evaluate on the same 
pixel grid with FWHM= 37.5. 



distribution of Sersic indices had a finite width, this would 
wash out the troughs separating the peaks. 

This simulation demonstrates that an incautious use of 
the concentration index (ignoring asymmetries and the PSF) 
can lead to a substantial loss of discriminative information. 
In practice, this loss causes sample distributions of the con- 
centration index to be of low modality, despite the diversity 
of the galaxy population - a problem already mentioned by 
Faber et al. (20071. Consequently, the concentration index 



can only provide a lower bound on the number of classes in a 
given data sample. If the sample distribution of the concen- 
tration is unimodel, this does not imply that all objects are 
of the same type. The loss of discrimina tive information im- 
plies that the mapping from Sect. 1.3 does not always 
exist for the concentration index, i.e. drawing inference is a 
very difficult task. 



5.2 Nonlinear & discontinuous parameter spaces 

This section highlights an additional problem, which is in- 
dependent of the previous considerations. It is based on the 
fact that all parametrisation schemes discussed here are non- 
linear in the data. As a direct consequence of this, the re- 
sulting parameter spaces form nonlinear spaces, too. If the 
parameter space is nonlinear, the distance metric will be 
nonlinear, too. Although this fact may be known, it is typ- 



ically ignored in practice. Usually, the Euclidean metric is 
employed whenever a distance-based algorithm is used, e.g., 



a principal components analysis (Scarlata et al. 20071 or 
classification algorithms (e.g. Gauci et al. 2010). The cru- 
cial question is: Does ignoring the nonlinearity and employ- 
ing the Euclidean distance leads us to misestimate the true 
distances between galaxy morphologies in the parameter 
space? If so, galaxies will seem more similar or less similar 
than they actually are and hence distance-based classifica- 
tion algorithms may face serious problems. There are only 
few classification algorithms that do not rely on distances 
Fraix-Burnet et al.|2009 1. 



(e. 



5.2.1 Nonlinearity 

Let us consider a parametrisation P(I) of an image I. This 
parametrisation is said to be linear in the image data, if 



P{a I A + P Ib) = a P(I A ) + P(I B ) 



(23) 



for any two images I a and Ib and any real- valued a and j3. 
Otherwise P is nonlinear. 

We begin by considering CAS (Eqs. |l}-([3|). Apart 
from the obvious nonlinearities in C due to the logarithm 
and the ratio of radii, the computation of the radii contain- 
ing 20% and 80% of the total flux itself is highly nonlinear. 
The nonlinearities in A and S are caused by the fractions 
and absolute values in the numerators. Gini (Eq. (J6|) and 
M 20 (Eq. j§) are both nonlinear in the data, too. For both 
of them the major nonlinearity is hidden in the sorting of the 
pixel values. The Sersic model given by Eq. |7| contains the 
Sersic index and the scale radius as nonlinear parameters. 

The nonlinearity of (spherically symmetric) shapelets is 
due to the scale radius /3 and the centroid xq. Both enter 



the basis functions nonlinear ly, as is evident from Eq. ( 12 1 



The nonlinearity of shapelets has been investigated in detail 



by Melchior et al. (20071, so we do not need to elaborate 
on this here. In case of sersiclets, the Sersic index is another 
nonlinear model parameter in addition to the scale radius. 



5.2.2 Demonstration of nonlinearity of C , A & Gini 

As emphasised above, CAS, Gini, M20 and the Sersic in- 
dex are nonlinear in the data. The crucial question is: Is 
the nonlinearity severe or can we assume local flatness in 
the parameter space and use the Euclidean metric as an ap- 
proximation? In order to answer this question, we now show 
a demonstration using three Sersic profiles with different 
Sersic indices and different flexion values as shown in Fig. 
1 1 1 1 There is no pixel noise in this simulation. We perform 
a linear transformation in the image space such that two 
images I a and Is linearly transform into each other, i.e. 



7(a) = (1 - a)I A + olI b 



(24) 



where a £ [0, 1] parametrises this linear transformation. 
In reality, the superpositions of this linear transformation 
may not represent viable galaxy morphologies, e.g. a — 0.5 
for ii O I3. A proper trajectory should be a geodesic on 
the submanifold of viable morphologies. If this submani- 



fold is linear, the trajectory defined by Eq. (24 1 will pass 



through viable morphologies only. If it is nonlinear, it will 
add additional nonlinearity to this test. This means that 
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Figure 11. Two-dimensional profiles with different asymmetries 
used for demonstration of nonlinearity. All objects are evaluated 
on a 1,000x1,000 pixel grid with scale radius /3 = 50. No intrin- 
sic ellipticity was applied. All maximum positions are identical. 
Profile I± (top left) has flexion Gi = 0.1 and n$ = 0.5. Profile I2 
(top right) has flexion F\ = 0.05 and n$ = 1. Profile ^3 (bottom) 
has flexion Gi = —0.1 and ng = 4. 



even though Eq. ( 24 1 passes through unrealistic morpholo- 
gies in this setup, it provides a lower limit to the nonlin- 
earity. For 100 equidistant values of a € [0, 1] we evaluate 
the mixed image 1(a) in pixel space and then estimate the 
concentration and asymmetry with respect to the maximum 
position. We also estimate the Gini coefficient. 

Figure[l2]shows the trajectories in the subspaces of C, A 
and Gini. Example objects Ji and I2 have very similar Sersic 
indices and flexion parameters, hence their transition pro- 
duces trajectories that are only moderately nonlinear. How- 
ever, example object ^3 is very different from I\ and I2 and 
thus its transitions produce trajectories that exhibit sub- 
stantial nonlinearities. Note that the nonlinearities in Fig. 
|12| are primarily induced by the lopsidedness via the asym- 
metry parameter, as is evident from the centre panel where 
A is not shown and virtually all nonlinearity is gone. 

We conclude from this simulation that for galaxy mor- 
phologies exhibiting realistic asymmetries the Euclidean dis- 
tance is a very poor approximation to distances in parameter 
space. Consequently, any algorithm based on Euclidean dis- 
tances would severely underestimate the true distances, i.e. 
objects would appear more similar than they actually are. 
This may be an explanation why the drop in the spectrum of 
eigenvalues of the principal components analysis of |Scarlata| 
et al. (20071 - which justifies the reduction of dimensional- 



ity - is not very decisive, ft may also partially account for 
the difficulty of recovering visual classifications using auto- 
mated algorithms (see e.g. Gauci et al. 2010[ ). This is no 
particular drawback of C, A and Gini, but applies to all 
other parametrisation schemes discussed here. It is highly 
questionable whether a "calibration" of the Euclidean dis- 
tance in order to account for the nonlinearity is possible. The 
reason for this is that, due to nonlinearity, the distance is 
an unknown function of the positions of both objects in pa- 




Figure 13. Discontinuity of concentration (a) and M20 (b). For 
poor sampling (small /?), concentration index and M20 exhibit 
substantial discontinuities. For better sampling (larger /3) the dis- 
continuities decrease. The transition was between two Sersic pro- 
files with rtg = 0.5 and ng = 2.0 and no intrinsic ellipcitity or 
lopsidedness. The scale radii were = 5 (blue lines) and = 15 
(red lines), respectively. The profiles were evaluated on a 300x300 
pixel grid. 



rameter space, i.e. the distance depends on the morphology. 
One possible solution is to try to estimate the true distance 



via a linear transformation as given by Eq. ( 24 1 , although 



that is computationally very expensive. Another option is to 
employ a method called "diffusion distance" ( Richards et al. 
2009) in order to estimate the true nonlinear distances. 



5.2.3 Discontinuity of spaces formed by C and M20 

In Fig. [13] we investigate the behaviour of concentration and 
M20 under a linear transformation between two Sersic pro- 
files. C and M20 exhibit substantial discontinuities due to 
pixellation effects. These effects increase for decreasing res- 
olution (i.e. decreasing /3 in Fig. |13[). 

In the case of C, the discontinuities occur because the 
radii containing 20% and 80% of the total image flux can 
only change in discrete steps. With increasing resolution, the 
pixel size decreases and the discontinuities of R20 and Rgo 
become smaller (cf. panels (a) and (b) in Fig. 13 1. Hence, 



this is not a problem for well resolved galaxies as in Fig. 
|12| However, it is a problem for poorly sampled galaxies. In 
this case, we can overcome this problem by interpolating the 
pixellised image and integrating numerically. Unfortunately, 
this would drastically increase the computational effort. In 
fact, the discontinuity of the concentration index has already 
been observed by |Lotz et al.| ( |2006| ). 

In the case of M20, the origin of the discontinuity is 
the sum over the second-order moments in the numerator 
of Eq. pi), which stops as soon as 20% of the total flux are 
reached. This threshold is the problem, as it causes the set of 
pixels fulfilling this criterion to change abruptly during the 
linear transformation. Again, the discontinuities of M20 de- 
crease with increasing resolution. However, for poorly sam- 
pled galaxies we cannot overcome these discontinuities by 
interpolation, since the definition of M20 only makes sense 
for pixellised images. 
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Figure 12. Trajectories in CA-Gini subspaces revealing substantial nonlinearities. Left panel: Trajectories in CA space. Centre panel: 
Trajectories in C-Gini space. Right panel: Trajectories in Gini-A space. In this simulation the nonlinearity is induced by the different 
lopsidedness of all objects (cf. Fig. |11| . The asymmetry is evaluated with respect to the maximum position, whereas the concentration 
is evaluated with respect to the centre of light. 



A parametrisation scheme forming discontinuous pa- 
rameter spaces is problematic, because it is not guaranteed 
that objects with similar morphologies end up in neigh- 
bouring regions of the parameter space. This implies that 
distances in the space formed e.g. by M20 do not neces- 
sarily correlate with the similarity of galaxy morphologies. 
We need similar morphologies to have smaller distances than 
dissimilar morphologies, but this is not guaranteed for C and 
M20 if the resolution is poor. Figure [13| suggests that such 
discontinuities become important when galaxies are smaller 
than 10 pixels in radius, maybe even earlier depending on 
the precise morphology. In this case, we even cannot rely on 
hard-cut classifications and it is questionable whether mean- 
ingful classification based on distances is possible at all. 



to employ a parametrisation scheme that produces a high- 
dimensional parameter space. Loosely speaking, it is better 
to start with too much information than with too little. We 
can overcome the curse of dimensionality, if we compress the 
parameter space, i.e. if we reduce its number of dimensions 
by identifying and discarding unimportant or redundant in- 
formation. For instance, |Kelly fc McKay| ( |2004| |2005[ ) ap- 
plied a principal component analysis in order to reduce the 
dimensionality of their parameter space[^] An alternative 
approach to overcome the curse of dimensionality is to em- 
ploy a kernel approach by describing the data using a sim- 
ilarity measure. We demonstrated in |Andrae et al.| ( |2010[ ) 
that this yields excellent results, e.g., allowing us to classify 
84 galaxies populating a 153-dimensional parameter space 
into three classes. 



5.3 High-dimensional parameter spaces 



Concerning classification, the current paradigm appears to 



favour low-dimensional parameter spaces (e.g. Scarlata et al. 
|2007[ ) that simplify the analysis or even allow a visual rep- 
resentation. However, we have to keep in mind that a high- 
dimensional parameter space may be necessary in order to 
differentiate between different groups of galaxy morpholo- 
gies. There is no physical reason to expect that a two- or 
even three-dimensional parameter space should be able to 
host such groups without washing out their differences. This 
solely depends on the complexity of the physics governing 
galaxy morphologies. 

In particular, basis-function expansions typically form 
parameter spaces of high dimensionality. For instance, the 
morphological parameter space used by |Kelly fc McKay] 
(20051 had 455 dimensions. Apart from problems with visu- 



alisation, we suffer from what is commonly called the curse 



of dimensionality (Bellman 19611: The hypervolume of a 



(parameter) space grows exponentially with its number of 
dimensions r_j Consequently, the density of data points in 
this parameter space is suppressed exponentially. Therefore, 
it is impossible to reliably model a data distribution in a 
parameter space of several hundred dimensions, no matter 
how much data is available. Nevertheless, it is preferable 



14 Consider a hypercube of edge length L in d dimensions. Its 
hypervolume L d grows exponentially with d. 



6 SUMMARY & CONCLUSIONS 

In this paper we have described and compared two differ- 
ent approaches to the parametrisation of galaxy morpholo- 
gies: First, model-independent schemes - CAS, Gini and 
M20. Second, model-based schemes - Sersic profiles and ba- 
sis functions. 

Our most important result is that morphological fea- 
tures (steepness of light profile, ellipticity, asymmetry, sub- 
structures, etc.) are intertwined and (at least some) cannot 
be estimated independently without introducing potentially 
serious biases. This intertwinement stems from the violation 
of one or more assumptions invoked by the parametrisation 
schemes. We emphasise that combining separate estimates 
of individual observables does not overcome the intertwine- 
ment. For instances, combining an ellipticity estimate and 
the fit of a circular Sersic profile does not give the same 
result as fitting an elliptical Sersic profile. No parametrisa- 
tion scheme discussed in this article accounts for all these 
observables simultaneously, i.e., their usage will inevitably 



15 Unfortunately, a principal component analysis (PCA) is a 
risky and often inappropriate tool in the context of classifica- 
tion. The reason is that PCA diagonalises the sample covariance 
matrix, i.e., it assumes that the whole data sample comes from a 
single Gaussian distribution. This assumption obviously jars with 
the goal of assigning objects to different classes. 
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cause problems when trying to parametrise large samples of 
galaxies that exhibit a huge variety of morphologies. 

In the context of classification of galaxy morphologies, 
which is an important application, we have the following 
results: 



• The intertwinement can wash out discriminative infor- 
mation in the context of classification. 

• All parametrisation schemes form nonlinear parame- 
ter spaces with a potentially highly nonlinear and unknown 
metric. Distance-based classification algorithms that employ 
the Euclidean distance measure therefore suffer from a loss 
of discriminative information. 

• For poorly resolved galaxies (object radius smaller than 
~10 pixels), concentration and M20 form discontinuous pa- 
rameter spaces that do not conserve neighbourhood relations 
of morphologies and may therefore fool classification algo- 
rithms. 

Due to the complexity of a nonlinear metric, it appears un- 
likely that calibrating results obtained from Euclidean dis- 
tance is possible. As we cannot expect to find a parametri- 
sation scheme that is linear in the data, a more promising 
approach is to estimate the nonlinear metric, e.g. via diffu- 



sion distances (Richards et al. 2009), or to use a classification 



algorithm that is not distance-based. An example for such 
an algorithm can be found in |Fraix-Burnet et al.| ( p009| l. 

6.1 Arguments in favour of model-based 
approaches 

In this paper we also collected arguments in favour of model- 
based approaches: 

• A (compact) model defines the term "centroid", i.e. 
whether we have to use the centre of light or the maximum 
position. 

• A model allows us to disentangle observables by 
marginalising the joint posterior distribution of all observ- 
ables. 

• A model allows us to assess reliability by providing 
residuals. 

• A model allows forward PSF modelling, which is more 
stable than backward modelling in the presence of pixel 
noise. 

Each of these arguments by itself disfavours model- 
independent approaches. Therefore, we conclude that 
schemes such as CAS, Gini and M20 are problematic for 
three reasons: 

(i) They try to measure morphological features inde- 
pendently ignoring their intertwinement (e.g. concentration 
does not account for asymmetry and vice versa). 

(ii) They do not provide residuals, i.e. we can neither as- 
sess reliability (to sort out failures for individual objects) 
nor marginalise. 

(iii) They do not allow forward PSF modelling, i.e. we 
may suffer from the instability of backward modelling, or, 
we need to introduce further assumptions via calibrations. 

Moreover, we have seen that robust implementations of CAS 
and M20 are neither easy nor computationally fast, since we 
have to consider centroid misestimations and - in the case 
of the concentration index - interpolation. 



We conclude that model-based parametrisation schemes 
are clearly superior. They provide reliable parametrisation 
schemes in all regimes of signal-to-noise ratios and resolu- 
tions. For low signal-to-noise ratios and low resolution the 



Sersic profile allows excellent parametrisations (e.g. Sargent 



et al. 20071. In the limit of high signal-to-noise ratios and 



high resolutions the method of shapelets is flexible enough to 



provide excellent model reconstructions (e.g. Andrae et al. 
|2010[ ). With the advent of sersiclets there will be another 
set of basis functions that is designed to provide even better 
parametrisations than shapelets ( Andrae et al.|in prep. 



6.2 Trade-offs 

Throughout this work we were facing two important trade- 
offs when comparing different parametrisation schemes for 
arbitrary galaxy morphologies, namely 

(i) simplicity vs. reliability and 

(ii) interpretation vs. flexibility. 

The first trade-off - simplicity vs. reliability - is obvi- 
ous. When dealing with large data samples, we have to find 
a parametrisation scheme that is not too expensive from a 
computational point of view. Apart from computational as- 
pects, we also favour simple solutions in general (Occam's 
razor). However, we have to beware of oversimplification 
which inevitably leads to unreliable results. The border- 
line between reasonable simplification and oversimplification 
should be defined by the data only and not by the researcher. 

The second trade-off - interpretation vs. flexibility - is 
at the heart of this article. We have seen that parametrisa- 
tion schemes that easily offer interpretation often lack flex- 
ibility (e.g. CAS), whereas other schemes (e.g. shapelets) 
excell in flexibility but lack interpretation. This is still an 
open issue and more work is needed on the interpretation of 
basis-function expansions. 

We should also add that there is actually no trade-off 
concerning computational feasibility. The parametrisation of 
samples of galaxies is trivial to parallelise, i.e. it can be done 
on numerous computers simultaneously. 



6.3 Recommendations and outlook 

We do not conclude that CAS, Gini and M20 should not 
be used anymore. According to their assumptions as given 
in Sect. [2741 these parametrisation schemes are highly spe- 
cialised on certain morphologies and their usage should be 
safe, if it is ensured that the sample of interest only con- 
tains galaxies of this special type. However, this obvious 
lack of flexibility renders these approaches inappropriate for 
general samples. Our most important recommendations for 
using CAS, Gini and M20 are as follows: 

• A PSF treatment is necessary at least in case of the 
concentration index. 

• Beware of undersampling effects in case of concentra- 
tion index and M20. Discontinuities can appear for objects 
of up to 10 pixels in radius. 

• Beware of the centroid ambiguity: Even for galaxies 
with realistic asymmetries the centre of light and maximum 
position do not coincide. In case of the concentration in- 
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dex, we recommend to fit for the centroid by maximising C, 



similar to the method of Conselice et al. (2000) 



Concerning the concentration index, we also recommend to 
use it only in the regime of intermediate signal-to-noise ra- 
tios and resolutions. The reasons is that its assumptions 



(Sect. 2.4 1 are almost identical to the assumptions of a Sersic 



profile. As a rule of thumb we can say that the concentra- 
tion index is reliable whenever the Sersic profile is a good 
description, and vice versa. 

Currently the most reliable parametrisation scheme is 
the two-dimensional Sersic profile enhanced by ellipticity, 
since it accounts for the steepness of the light profile and 
for ellipticity. These are definitely the two most important 
morphological observables. In the presence of asymmetries 
we recommend defining the centroid by fitting for the max- 
imum position of the profile rather than fixing it to the cen- 
tre of light. However, the Sersic profile does not account for 
asymmetry or substructures and is thus of limited usefulness 
for samples containing highly irregular galaxies and in the 
regime of high signal-to-noise ratios and high resolutions. 
Moreover, we have shown that the scale radius of the Sersic 
profile is difficult to interpret. In particular we have argued 
that the scale radii of profiles of different Sersic indices can- 
not be compared directly. We also demonstrated that a re- 
definition of the Sersic model may simplify the fitting pro- 
cedure and provide more robust parameter estimates. 

Our main conclusion is: None of the existing parametri- 
sation schemes is applicable to the task of parametrising 
arbitrary galaxy morphologies that occur in large samples, 
since they all have their drawbacks. Therefore, we need a 
new parametrisation scheme. Our recommendations for its 
design are as follows: 

(i) It should be model-based. 

(ii) It should estimate all relevant morphological features 
simultaneously. 

(iii) It should provide excellent model reconstructions of 
galaxies in the regime of high signal-to-noise ratios and high 
resolutions. 

(iv) It should form a metric parameter space such that it 
is possible to estimate distances for classification purposes. 

One possible solution is to modify e.g. the Sersic profile in 
order to account for asymmetries and substructures (Gal- 
fit 3, Peng et al. 20101. In our opinion basis functions are 



also promising candidates to describe arbitrary morpholo- 
gies, since they are highly flexible. However, current sets 
of basis functions still lack direct physical interpretation. 
Currently, we reinvestigate the method of sersiclets which 
appears to be the most promising approach given the con- 
siderations of this paper. 



ACKNOWLEDGEMENTS 

RA thanks Eric Bell for discussions that initialised this work. 
RA also thanks Matthias Bartelmann, Thorsten Lisker, 
Aday Robaina Rapisarda, Mark Sargent, Paraskevi "Vivi" 
Tsalmantza, Glenn van de Ven, and Katherine Inskip for 
helpful comments on the content of this paper. Further- 
more, we thank Claudia Scarlata for pointing out a mistake 
in an earlier version of this manuscript. RA is funded by a 



Klaus- Tschira scholarship. KJ is supported by the Emmy- 
Noether-programme of the DFG. PM is supported by the 
DFG Priority Programme 1177. 



REFERENCES 

Abazajian K. N., Adelman-McCarthy J. K., Agiieros M. A., 

et al., 2009, ApJS, 182, 543 
Abraham R. G., Tanvir N. R., Santiago B. X., Ellis R. S., 

Glazebrook K., van den Bergh S., 1996, MNRAS, 279, L47 
Abraham R. G., Valdes F., Yee H. K. C, van den Bergh 

S., 1994, ApJ, 432, 75 
Andrae R., Melchior P., Bartelmann M., 2010, ArXiv e- 

prints 

Andrae R., Melchior P., Jahnke K., in prep. 

Bamford S. P., Nichol R. C, Baldry I. K., et al., 2009, 

MNRAS, 393, 1324 
Bartelmann M., Schneider P., 2001, Phys. Rep., 340, 291 
Bellman R., 1961, Adaptive Control Processes: A Guided 

Tour, Princeton University Press 
Bershady M. A., Jangren A., Conselice C. J., 2000, AJ, 

119, 2645 

Conselice C. J., Bershady M. A., Jangren A., 2000, ApJ, 
529, 886 

Faber S. M., Willmer C. N. A., Wolf C, et al., 2007, ApJ, 
665, 265 

Ferreras I., Lisker T., Pasquali A., Kaviraj S., 2009, MN- 
RAS, 395, 554 

Fraix-Burnet D., Davoust E., Charbonnel C, 2009, MN- 
RAS, 398, 1706 

Fukugita M., Nakamura O., Okamura S., et al., 2007, AJ, 
134, 579 

Gauci A., Zarb Adami K., Abela J., Magro A., 2010, ArXiv 
e-prints 

Glasser C, 1962, Journal of the American Statistical As- 
sociation, 57, 648 
Goldberg D. M., Bacon D. J., 2005, ApJ, 619, 741 
Graham A. W., Driver S. P., 2005, Publications of the As- 
tronomical Society of Australia, 22, 118 
Haring N., Rix H., 2004, ApJ, 604, L89 
Kelly B. C, McKay T. A., 2004, AJ, 127, 625 
Kelly B. C, McKay T. A., 2005, AJ, 129, 1287 
Lisker T., 2008, ApJS, 179, 319 

Lotz J. M., Davis M., Faber S. M., et al., 2008, ApJ, 672, 
177 

Lotz J. M., Madau P., Giavalisco M., Primack J., Ferguson 

H. C, 2006, ApJ, 636, 592 
Lotz J. M., Primack J., Madau P., 2004, AJ, 128, 163 
MacKay D., 2008, Cambridge University Press 
Melchior P., Andrae R., Maturi M., Bartelmann M., 2009, 

A&A, 493, 727 
Melchior P., Bohnert A., Lombardi M., Bartelmann M., 

2010, A&A, 510, A75+ 
Melchior P., Meneghetti M., Bartelmann M., 2007, A&A, 

463, 1215 

Ngan W., van Waerbeke L., Mahdavi A., Heymans C, 

Hoekstra H., 2009, MNRAS, 396, 1211 
Peng C. Y., Ho L. C, Impey C. D., Rix H., 2010, AJ, 139, 

2097 

Refregier A., 2003, MNRAS, 338, 35 



18 R. Andrae, K. Jahnke & P. Melchior (2010) 



Richards J. W., Freeman P. E., Lee A. B., Schafer C. M., 

2009, ApJ, 691, 32 
Sargent M. T., Carollo C. M., Lilly S. J., et al., 2007, ApJS, 

172, 434 

Scarlata C, Carollo C. M., Lilly S., et al., 2007, ApJS, 172, 
406 

Scoville N., Abraham R. G., Aussel H., et al., 2007, ApJS, 
172, 38 

Sersic J. L., 1968, Atlas de galaxias australes, Cordoba, 
Argentina: Observatorio Astronomico, 1968 

Simmat E., Tuffs R. J., Popescu C. C, 2010, in American 
Institute of Physics Conference Series, vol. 1240, 87-88 

Strateva I., Ivezic Z., Knapp G. R., et al., 2001, AJ, 122, 
1861 

Strauss M. A., Weinberg D. H., Lupton R. H., et al., 2002, 
AJ, 124, 1810 

Trujillo I., Graham A. W., Caon N., 2001, MNRAS, 326, 
869 

Woo J., Treu T., Malkan M. A., Blandford R. D., 2006, 
ApJ, 645, 900 



and 

A, a = ( " 72a , ^ 72 ' 2 ) ■ (A9) 
V -72,2 271,2 - 72,1 / 

Using these matrices, we do not evaluate a flexed Sersic pro- 
file at position x — (xi,x 2 ), but rather at position 

x[ = Xi + ^D i3 kXjXk ■ (A10) 

The scaling of the coordinates by the scale radius /3 of the 
Sersic profile is applied prior to this flexion transformation. 

This paper has been typeset from a TgX/ I^TjrjX file prepared 
by the author. 



APPENDIX A: SHEAR AND FLEXION 
TRANSFORMATION 

We now briefly resume the shear and flexion transformation 
we are using to simulate ellipticity and lopsidedness - the 
latter being a special kind of asymmetry. 

Given the complex ellipticity, e = ei + ie 2 , with axis 
ratio q = | = ^~|^| and orientation angle 9 = | arctan(^), 
the "sheared" coordinates, (xi,x' 2 ), are given by 



x 1 \ _ f 1 - ea -£2 \ I xi 
X2 J I ~ £ 2 1 + ei J \ x 2 



(Al) 



For given pixel coordinates (x 1 ,X2), we then evaluate the 
model at (x'i, x' 2 )- 

The flexion transformation ( Goldberg fc Bacon|2005 1 is 
parametrised by the first flexion 

F = F!+ 1F2 (A2) 

and the second flexion 

G = Gi + iG 2 ■ (A3) 

Given these parameters, we compute the derivatives of the 
gravitational shear 7 = (71,72), 

7i.i = ^i + Gi) (A4) 



72,2 = i(Fi - Gi) (A5) 



71,2 = i(G2-F 2 ) (A6) 



72,1 = l(G 2 + F 2 ) . (A7) 
Based on these derivatives, we compute the two matrices 

D m =( - 2 TM-72,2 -72,1 \ (Ag) 

V -72,1 -72,2 I 



